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I 


The  Research  and  Technology 
Organization  (RTO)  of  NATO 


RTO  is  the  single  focus  in  NATO  for  Defence  Research  and  Technology  activities.  Its  mission  is  to  conduct  and  promote 
cooperative  research  and  information  exchange.  The  objective  is  to  support  the  development  and  effective  use  of  national 
defence  research  and  technology  and  to  meet  the  military  needs  of  the  Alliance,  to  maintain  a  technological  lead,  and  to 
provide  advice  to  NATO  and  national  decision  makers.  The  RTO  performs  its  mission  with  the  support  of  an  extensive 
network  of  national  experts.  It  also  ensures  effective  coordination  with  other  NATO  bodies  involved  in  R&T  activities. 

RTO  reports  both  to  the  Military  Committee  of  NATO  and  to  the  Conference  of  National  Armament  Directors.  It  comprises  a 
Research  and  Technology  Board  (RTB)  as  the  highest  level  of  national  representation  and  the  Research  and  Technology 
Agency  (RTA),  a  dedicated  staff  with  its  headquarters  in  Neuilly,  near  Paris,  France.  In  order  to  facilitate  contacts  with  the 
military  users  and  other  NATO  activities,  a  small  part  of  the  RTA  staff  is  located  in  NATO  Headquarters  in  Brussels.  The 
Brussels  staff  also  coordinates  RTO’s  cooperation  with  nations  in  Middle  and  Eastern  Europe,  to  which  RTO  attaches 
particular  importance  especially  as  working  together  in  the  field  of  research  is  one  of  the  more  promising  areas  of  initial 
cooperation. 

The  total  spectrum  of  R&T  activities  is  covered  by  7  Panels,  dealing  with: 

•  SAS  Studies,  Analysis  and  Simulation 

•  SCI  Systems  Concepts  and  Integration 

•  SET  Sensors  and  Electronics  Technology 

•  1ST  Information  Systems  Technology 

•  AVT  Applied  Vehicle  Technology 

•  HFM  Human  Factors  and  Medicine 

•  MSG  Modelling  and  Simulation 

These  Panels  are  made  up  of  national  representatives  as  well  as  generally  recognised  ‘world  class’  scientists.  The  Panels  also 
provide  a  communication  link  to  military  users  and  other  NATO  bodies.  RTO’s  scientific  and  technological  work  is  carried 
out  by  Technical  Teams,  created  for  specific  activities  and  with  a  specific  duration.  Such  Technical  Teams  can  organise 
workshops,  symposia,  field  trials,  lecture  series  and  training  courses.  An  important  function  of  these  Technical  Teams  is  to 
ensure  the  continuity  of  the  expert  networks. 

RTO  builds  upon  earlier  cooperation  in  defence  research  and  technology  as  set-up  under  the  Advisory  Group  for  Aerospace 
Research  and  Development  (AGARD)  and  the  Defence  Research  Group  (DRG).  AGARD  and  the  DRG  share  common  roots 
in  that  they  were  both  established  at  the  initiative  of  Dr  Theodore  von  Karman,  a  leading  aerospace  scientist,  who  early  on 
recognised  the  importance  of  scientific  support  for  the  Allied  Armed  Forces.  RTO  is  capitalising  on  these  common  roots  in 
order  to  provide  the  Alliance  and  the  NATO  nations  with  a  strong  scientific  and  technological  basis  that  will  guarantee  a 
solid  base  for  the  future. 

The  content  of  this  publication  has  been  reproduced 
directly  from  material  supplied  by  RTO  or  the  authors. 
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Databases  for  Assessment  of  Military  Speech 
Technology  Equipment 

(RTO  TR-25) 

Executive  Summary 

(i)  A  NATO  research  group  carries  out  collaborative  studies  on  military  applications  of  speech  processing. 
A  major  requirement  in  this  area  of  work  is  for  large  quantities  of  speech  recordings  made  in  military 
environments,  which  are  often  expensive  and  difficult  to  obtain.  Research  and  development  in  this  area  will 
benefit  from  sharing  such  data  as  widely  as  possible  among  the  NATO  research  community. 

MAJOR  RECOMMENDATIONS 

(ii)  The  NATO  research  group  on  speech  processing  should  continue  to  collate  and  disseminate  information 
about  available  speech  databases  of  relevance  to  research  and  development  of  military  speech  technology. 

MILITARY  IMPLICATIONS 

(iii)  The  cost  of  collecting  speech  recordings  under  realistic  military  conditions  is  high.  Considerable  cost 
savings  may  be  made  if  such  data  are  shared  as  widely  as  possible  amongst  the  NATO  community.  Robust 
performance  under  field  conditions  will  also  be  improved  by  exposure  to  a  wide  variety  of  speech  during 
development. 

FURTHER  WORK 

(iv)  The  NATO  research  study  group  on  speech  and  language  technology  (IST-TG001)  will  continue  to 
maintain  and  update  the  database  of  speech  recordings  relevant  to  military  applications  of  speech 
technology.  Further  ways  of  disseminating  this  information  will  be  sought,  including  electronic  means  such 
as  the  Internet. 


Les  bases  de  donnees  pour  revaluation  des 
equipements  de  technologie  vocale  militaire 

(RTO  TR-25) 


Synthese 

(i)  Un  groupe  de  recherche  OTAN  effectue  des  etudes  sur  les  applications  militaires  du  traitement  de  la 
parole.  Dans  ce  domaine  il  faut  de  grandes  quantites  d’enregistrements  effectues  en  environnement 
militaire,  enregistrements  qui  sont  souvent  couteux  et  difficiles  a  obtenir.  La  recherche  et  le  developpement 
ne  peuvent  que  beneficier  d’un  partage  aussi  large  que  possible  de  telles  ressources  au  sein  des  pays  de 
l’OTAN. 

RECOMMANDATIONS  MAJEURES 

(ii)  Le  groupe  de  recherche  OTAN  etudiant  le  traitement  de  la  parole  doit  continuer  a  collecter  et  a 
disseminer  1’ information  sur  les  bases  de  donnees  de  parole,  disponibles  et  pertinentes,  pour  la  recherche  et 
le  developpement  des  technologies  vocales  militaires. 

ENJEUS  MILITAIRES 

(iii)  Le  cout  de  la  collecte  d’enregistrements  de  parole  dans  des  conditions  militaires  realistes  est  eleve.  Des 
economies  considerables  peuvent  etre  realisees  si  de  telles  donnees  sont  partagees  aussi  largement  que 
possible  au  sein  de  la  communaute  OTAN.  La  robustesse  des  systemes  en  conditions  reelles  sera  aussi 
amelioree  grace  a  la  confrontation  a  une  grande  variete  de  telles  donnees  pendant  leur  developpement. 

PERSPECTIVES 

(iv)  Le  groupe  de  recherche  OTAN  etudiant  le  traitement  de  la  parole  continuera  a  tenir  a  jour  la  base  de 
donnees  d’enregistrement  de  parole  pour  1’evaluation  des  equipements  de  technologie  vocale  militaire.  Des 
voies  supplementaires  de  diffusion  de  cette  information  seront  recherchees,  y  compris  les  moyens 
electroniques  tels  quTntemet. 
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Preface 


Speech  technology  has  the  potential  to  be  of  great  use  in  many  areas  of  military  operations.  Large  quantities  of 
realistic  speech  recordings  are  a  necessity  for  research  in  this  area,  and  for  the  assessment  of  techniques  and 
equipment.  Collection  of  such  speech  recordings  is  usually  expensive  and  time-consuming,  so  considerable 
savings  may  be  made  if  such  data  are  shared  between  users  in  NATO  countries.  This  work  was  started  by  the 
former  DRG  Research  Study  Group  (RSG10). 

This  report  describes  a  database  maintained  by  the  Task  Group  on  Speech  and  Language  Technology  of  the  RTO 
Information  Systems  Technology  Panel.  The  report  contains  details  of  speech  recordings  relevant  to  military 
operations,  which  may  be  made  available  to  NATO  partners.  The  aim  of  this  report  is  to  increase  awareness  of 
this  database,  so  that  the  benefits  of  sharing  the  recordings  may  be  maximized. 


Preface 

Les  technologies  vocales  ont  le  potentiel  d’etre  tres  utiles  dans  de  nombreux  domaines  des  operations  militaires. 
De  grandes  quantites  d’enregistrements  realistes  sont  necessaires  pour  la  recherche  dans  ce  domaine  et  pour 
1’evaluation  des  techniques  et  des  equipements.  La  collecte  de  tels  enregistrements  est  generalement  couteuse  et 
consommatrice  de  temps,  et  des  economies  substancielles  pourraient  etre  faites  si  de  telles  donnees  etaient 
partagees  entre  les  utilisateurs  des  pays  de  l’OT AN.  Ce  travail  a  ete  initid  par  l’ancien  groupe  de  recherche  et 
d’etude  RSG10. 

Ce  rapport  decrit  une  base  de  donnees  tenue  a  jour  par  le  groupe  sur  le  traitement  de  la  parole  et  du  langage, 
groupe  issu  de  la  commission  RTO  sur  les  technologies  des  systemes  d’information  (1ST).  II  foumit  des  details 
sur  les  enregistrements  de  parole  utiles  pour  les  operations  militaires,  qui  sont  disponibles  pour  les  partenaires  de 
LOT AN.  Le  but  de  ce  rapport  est  de  mieux  faire  connaitre  cette  base  de  donnees,  de  maniere  a  beneficier  au 
mieux  des  possibilites  de  partager  ces  enregistrements. 


Foreword 


Efficient  speech  communication  is  recognized  as  a  critical  and  instrumental  capability  in  many  military 
applications  such  as  command  and  control,  aircraft  and  vehicle  operations,  military  communication,  translation, 
intelligence,  and  training.  The  former  NATO  research  study  group  on  speech  processing  (AC243  (Panel  3) 
RSG.10)  conducts  since  its  establishment  in  1978  experiments  and  surveys  focused  on  military  applications  of 
language  processing.  Presently  the  work  is  performed  by  the  1ST  Task  Group  001.  Guided  by  its  mandate,  the 
former  RSG.10  initiated  in  the  past  the  publication  of  overviews  on  potential  applications  of  speech  technology 
for  military  use  and  also  organized  several  workshops  and  lecture  series  on  military-relevant  speech  technology 
topics.  In  recent  years,  the  speech  R&D  community  has  developed  or  enhanced  many  technologies  which  can 
now  be  integrated  into  a  wide-range  of  military  applications  and  systems.  Development  and  assessment  of 
speech  technology  for  military  applications  requires  representative  speech  material.  In  the  past  many  data  bases 
have  been  collected  and  distributed  on  various  means  such  as  CD-ROM.  This  report  gives  an  overview  of 
representative  databases  for  military  speech  research. 
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1.  INTRODUCTION 

The  former  Research  Study  Group  on  Speech  Processing  (AC243/Panel  3/RSG.10)  was 
set  up  in  1978  in  order  to  address  speech  processing  issues  of  interest  to  military  system  designers. 
The  Group  has  since  conducted  collaborative  projects  on  isolated  and  connected  word  recognition 
(Bridle,  1983),  recognition  in  a  multi-lingual  environment  (Moore  1988),  and  recognition  in 
additive  noise  (Steeneken  and  Varga  1993,  Gagnon  and  Cupples  1995).  Workshops  have  been 
organised  on  dialogue  structures  (Taylor,  Neel,  and  Bouwhuis,  eds.  1989),  Applications  of  Speech 
Technology  (Mangold,  Hunt,  and  Neel,  1993).,  and  Speech  under  Stress  (Moore  and  Trancoso, 
1995).  Two  major  reports  have  been  produced  on  the  military  applications  of  speech  technology 
(Weinstein  1991,  Steeneken  (ed.)  1996.).  Presently  the  work  is  continued  by  the  RTO-IST  Task 
group  (AC  232/IST/TG001). 

A  by-product  of  some  of  these  projects  has  been  the  creation  of  databases  of  speech 
recordings  suitable  for  the  assessment  of  the  performance  of  speech  technology  equipment  under 
military  conditions.  In  addition,  speech  databases  have  been  created  as  part  of  the  work  of  various 
laboratories  working  with  speech  technology  for  military  applications.  Many  of  these  databases 
can  be  made  available  to  other  researchers  within  the  NATO  countries,  so  RSG.10  has  maintained 
a  list  of  them  for  several  years.  The  purpose  of  this  report  is  to  increase  awareness  of  this  list  in 
order  to  facilitate  exchange  of  information  and  resources  between  NATO  countries. 


2.  MILITARY  BENEFIT 


Speech  and  language  technology  has  many  potential  applications  in  military  operations, 
including  command  and  control,  intelligence,  man-machine  interface,  machine  translation,  and 
others  (Steeneken  (ed)  1996).  A  general  problem  in  research  and  development  in  speech 
technology  is  the  availability  of  suitable  databases  of  speech  recordings  for  assessment  of  the 
equipment.  These  databases  need  to  be  as  realistic  as  possible,  to  recreate  the  many  effects  that 
stresses  in  the  military  environment  may  have  on  speech  production.  However,  the  collection  of 
realistic  recordings  of  military  speech  is  often  expensive  (for  example,  in  fast-jet  cockpits)  and 
usually  requires  a  considerable  effort  to  process  the  recordings  into  computer-readable  fomtat 
afterwards.  For  this  reason,  such  resources  are  valuable  and  considerable  cost  savings  may  be 
made  by  sharing  them  between  researchers  where  possible. 

The  field  performance  of  speech  technology  equipment  should  also  benefit  in  the  long 
run.  Robust  performance  under  changing  conditions  is  a  necessity  for  successful  military 
application,  and  exposure  to  a  wider  variety  of  speech  data  during  development  will  encourage 
this. 


3.  THE  DATABASE  LISTING 


3.1  Structure 

The  database  listing  is  maintained  as  a  Microsoft  Access™  database  consisting  of  two 
linked  tables.  The  main  table  contains  details  of  the  speech  recordings,  and  a  second  table  contains 
details  of  the  institutions  which  produced  them  and  a  person  to  contact.  Separate  tables  are  used  to 
avoid  duplication  of  data;  only  a  limited  number  of  organisations  produce  speech  databases  for 
military  applications,  so  that  a  single  entry  in  the  contacts  table  may  be  referred  to  by  several 
entries  in  the  database  table.  Full  details  of  the  database  structures  are  given  in  Annex  A.  In  many 
cases,  there  is  no  information  in  some  of  the  less  important  fields. 
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3.2  Inclusion  Criteria 

The  criteria  for  inclusion  in  the  listing  are: 

•  Military  relevance 

•  Availability  to  other  researchers  within  NATO. 

Military  relevance  may  result  from  the  vocabulary,  noise  background,  microphone  type, 
type  of  stress  on  the  speaker,  multi-linguality,  or  any  other  characteristic  which  may  arise  in  the 
context  of  military  usage  of  speech  technology.  Availability  may  be  as  a  result  of  open  publication 
or  by  mutual  agreement  between  the  database  producer  and  an  institution  wishing  to  use  it. 

Recordings  of  noises  encountered  in  military  situations  are  also  included  because  of 
their  direct  relevance  to  the  performance  of  speech  technology  equipment. 

Information  about  other  speech  and  language  databases  (not  intended  specifically  for 
military  applications)  is  available  from  the  Linguistic  Data  Consortium  and  the  European 
Language  Resources  Association.  Contact  details  for  these  organisations  are  given  in  Annex  C. 

3.3  Report  Formats 

Three  formats  of  reports  are  available,  differing  in  the  amount  of  detail  supplied.  The 
first  gives  only  a  list  of  contents  of  the  database,  with  the  title,  language  and  year  of  creation  of 
each  entry.  The  second  gives  a  summary  of  the  main  fields  of  each  entry,  while  the  third  gives  full 
details  of  all  entries.  A  copy  of  the  current  full  report  is  included  at  Annex  B. 

3.4  Updating 

The  listing  is  maintained  at  the  Defence  Evaluation  and  Research  Agency  (DERA), 
Famborough,  UK,  by  the  author  of  this  report,  who  is  necessarily  dependent  on  the  database 
producers  for  details  of  the  recordings  and  for  awareness  of  their  existence.  The  intention  is  to 
produce  an  update  twice  each  year,  if  new  information  has  been  received. 

3.5  Dissemination  of  the  Listing 

A  copy  of  the  database  and  the  reports  produced  from  it  are  available  on  the  RSG.10  ftp 
server  which  may  be  accessed  via  anonymous  login  at: 

site:  ftp.tm.tno.nl 

username:  rsginfo  (no  password  required). 

These  files  will  be  updated  about  every  six  months,  providing  that  new  information  has 
been  added  to  the  database.  Copies  may  also  be  obtained  from  the  author  at  the  address  given  in 
Annex  B. 


Other  information  relating  to  Speech  Under  Stress  may  be  obtained  from  a  site  on  the 
World  Wide  Web  maintained  by  the  Robust  Speech  Processing  Laboratory  at  Duke  University, 
North  Carolina,  USA.  The  address  is 

http:/www.ee.duke.edu/Research/Speech/stress.html. 
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4.  CONCLUSION 


Speech  technology  has  considerable  potential  benefits  for  military  operations,  but 
assessment  of  systems  under  realistic  conditions  requires  large  corpora  of  speech  recordings  which 
are  expensive  to  collect.  Significant  savings  may  be  made  if  speech  corpora  can  be  shared  with 
other  potential  users  within  NATO.  This  report  has  described  a  database  of  information  relating  to 
speech  recordings  of  military  relevance,  with  the  aim  of  making  potential  users  aware  of  what  is 
available.  The  database  is  maintained  at  DERA  Famborough,  UK,  on  behalf  of  the  NATO 
Research  Study  Group  on  Speech  Processing  (AC243/Panel  3/RSG.10).  At  the  time  of  writing,  the 
details  of  40  speech  and  noise  corpora  are  included. 
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ANNEX  A.  Database  Structures 


SPEECHDB  table: 

Field  name 

Type 

Size 

Properties 

SerialNo 

Number 

Int. 

Required,  Index  field 

Name 

Text 

50 

Req 

Brief  Description 

Text 

100 

Req 

Brief  Purpose 

Text 

100 

Req 

Date  of  creation 

Number 

Int. 

Req,  Index  field,  (year) 

Material 

Text 

50 

Req 

Language 

Text 

20 

Req 

Quantity 

Number 

Int 

(recording  time  in  hours) 

No.  of  Speakers 

Number 

Int 

Gender  of  speakers 

Text 

6 

(MALE,  FEMALE,  or  BOTH) 

Native  Language 

Text 

20 

(defaults  to  Language) 

Recording  Medium 

Text 

30 

Req 

Sampling  Rate 

Text 

10 

Microphone 

Text 

30 

Contact  Number 

Number 

Int 

Req,  Index,  Link-CONTACTS 

Author  1 

Text 

50 

(of  report  describing  database) 

Author  2 

Text 

50 

Author  3 

Text 

50 

Report  Title 

Text 

100 

Year  of  Pub. 

Number 

Int 

(Year  of  publication) 

Publisher 

Text 

50 

(Organisation,  Book,  Journal,  etc.) 

Report  Number 

Text 

50 

Availability 

Text 

30 

(Open,  restricted,  etc.) 

Users 

Text 

30 

(Institutions  known  to  be  using  it) 

Applications 

Text 

30 

Req  (Application  intended  by  producer) 

Full  Description 

Text 

255 

(More  detailed  description) 

Annotation 

Text 

30 

(Format  of  labelling) 

Aux.  Info. 

Text 

50 

Signal  type 

Text 

30 

(eg  waveform,  LPC  params) 

Date  info  entered 

Date 

(17) 

Long  date 
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CONTACTS  table: 


Field  name 

Type 

Size 

Name 

Text 

30 

Department 

Text 

30 

Institution 

Text 

50 

Abbr  Inst 

Text 

15 

Street 

Text 

50 

Town 

Text 

40 

State 

Text 

40 

Postcode 

Text 

15 

Country 

Text 

40 

Telephone 

Text 

20 

FAX 

Text 

20 

Email 

Text 

50 

Serial  Number 

Number 

int 

Properties 

Req  (Contact  person  for  database) 

Req 

(Usual  abbreviation  for  institution) 

Req 

Req 

Req,  Index  field 
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ANNEX  B.  Database  Listing 

TG001  SPEECH  DATABASE  LISTING 
Full  Report 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


1  Database  Name:  Diagnostic  Rhyme  Test 

2ft  Lists  of  96  words  each 

Measurement  of  DRT  scores  of  transmission  channels 

Rhyme  Words  Selected  on  6  Phonetic  Features  Language:  Dutch 

5  Hours  NoofSpkrs:  4  Gender:  MALE  Year:  1982 

Signal:  Waveform  Sampling  Rate:  N/A 

Analogue  tape  7.5  ips  Annotation: 

Unlimited  Applications:  Comms 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken  @  tm.tno.nl 


Institute:  TNO/1ZF 
Fax:  +31  3463  53977 

Date  Entered:  08  March  1994 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


2  Database  Name:  Helicopter  Word  List 

Vocabulary  of  60  words,  3  flight  conditions 
Evaluation  of  automatic  speech  recognition 

Cockpit  Vocabulary  Language:  English 

2  Hours  NoofSpkrs:  4  Gender:  MALE  Year: 

Boom  Signal:  Waveform  Sampling  Rate: 

Analogue  tape  Annotation: 

Unlimited  Applications:  Aircraft 


1986 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken  @  tm.tno.nl 


Institute:  TNO/IZF 
Fax:  +31  3463  53977 
Date  Entered:  08  March  1994 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


3  Database  Name:  RSG.10  Noise  Database 

29  samples  of  military  noises 
To  standardise  some  noises  for  speech  research 
Noises  in  tanks,  aircraft,  ships,  etc. 

2  Hours  No  of  Spkrs:  0  Gender:  - 

Various  Signal:  Waveform 

one  CD-ROM 
Unlimited 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken@tm.tno.nl 


Language:  Noise 

Year:  1990 

Sampling  Rate:  16  kHz 


Annotation:  N/A 
Applications:  Military 


Institute:  TNO/IZF 
Fax:  +31  3463  53977 
Date  Entered:  30  October  1996 


Serial  No:  4  Database  Name:  NATO-RSG.IO  Spoken  Digit  Database 

Description:  Isolated  connected  digits  in  several  languages 

Purpose:  To  compare  performance  of  connected  word  recognisers  on  native  and  non-native  speakers 
Material:  Digits  Language:  DuFrGeUKUS 

Quantity:  2ft  Hours  NoofSpkrs:  19  Gender:  BOTH  Year:  1982 

Microphone:  Signal:  Waveform  Sampling  Rate:  N/A 

Medium:  Analogue  tape  Annotation: 

Availability:  Applications:  Assessment 

Aux  Info: 

Contact:  J  S  Garofolo  Institute:  NIST 

Telephone:  Fax: 

E-mail: 


Date  Entered:  24  March  1 997 
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Serial  No:  5  Database  Name:  RSRE  1983  Speech  Database 

Description:  Isolated  and  connected  words  with  various  speaking  styles 
Purpose:  Recognition  Algorithm  development  and  testing 

Material:  Digits,  Letters,  DRT  Words,  others  Language:  UK  English 

Quantity:  100  Hours  NoofSpkrs:  15  Gender:  BOTH  Year:  1983 

Microphone:  SM-10  Signal:  Waveform  Sampling  Rate: 

Medium:  SONY  PCM  Annotation: 

Availability:  Unlimited  Applications:  Assessment 

Aux  Info: 

Contact:  J  McQuillan  Institute:  DERA/SRU 

Telephone:  +44  1684  894361  Fax:  +44  1684  894540 

E-mail:  jmq@signal.dra.hmg.gbDate  Entered:  24  March  1997 

Serial  No:  6  Database  Name:  40-Speaker  Digit  Database 

Description:  400  isolated  digits  per  speaker 
Purpose:  To  study  speaker  consistency 

Material:  Digits  Language:  UK  English 

Quantity:  10  Hours  NoofSpkrs:  40  Gender:  BOTH  Year:  1986 

Microphone:  Signal:  Waveform  Sampling  Rate: 

Medium:  SONY  PCM  Annotation: 

Availability:  Unlimited  Applications:  Research 

Aux  Info: 

Contact:  J  McQuillan  Institute:  DERA/SRU 

Telephone:  +44  1684  894361  Fax:  +44  1684  894540 

E-mail:  jmq@signal.dra.hmg.gb Date  Entered:  24  March  1997 


Serial  No:  7  Database  Name:  DUR-Words 

Description:  word  pairs 

Purpose:  To  study  durational  clues 

Material:  11  minimally  distinct  word  pairs  Language:  UK  English 

Quantity:  1  Hours  NoofSpkrs:  ?  Gender:  MALE  Year:  1982 

Microphone:  Signal:  Channel  vocoder  dataSampling  Rate: 


Medium:  Computer  files 
Availability:  Unlimited 
Aux  Info: 

Contact:  J  McQuillan 
Telephone:  +44  1684  894361 

E-mail:  jmq@signal.dra.hmg.gb 


Annotation: 

Applications:  Research 

Institute:  DERA/SRU 
Fax:  +441684  894540 

Date  Entered: 


08  March  1994 


Serial  No:  8  Database  Name:  POLS  Noise  Tape 

Description:  Speech  plus  speech  spectrum  noise  at  various  SNRs 
Purpose:  Assessment 

Material:  Isolated  and  Connected  Digits  Language:  English 

Quantity:  1  Hours  NoofSpkrs:  1  Gender:  MALE  Year: 


1983 


Microphone: 

Medium:  SONY  PCM 
Availability:  Unlimited 
Aux  Info: 

Contact:  Dr.  H  J  M  Steeneken 
Telephone:  +31  3463  56269 
E-mail:  steeneken@tm.tno.nl 


Signal:  Waveform  Sampling  Rate: 

Annotation: 

Applications:  Assessment 

Institute:  TNO/IZF 
Fax:  +31  3463  53977 

Date  Entered:  08  March  1994 
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Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


9  Database  Name:  POLS  Babble  Tape 

Speech  +  STl  l'EL  noiseat  various  SNRs 
Assessment 

Digits  Language:  English 

1  Hours  No  of  Spkrs:  1  Gender:  MALE  Year: 

Signal:  Waveform  Sampling  Rate: 

SONY  PCM  Annotation: 

Unlimited  Applications:  Research 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken  @  tm.tno.nl 


Institute:  TNO/IZF 
Fax:  +31  3463  53977 
Date  Entered:  08  March  1994 


1987 


Serial  No:  10  Database  Name:  BAC  111  Recordings 
Description:  Recordings  made  in  the  cockpit  of  a  BAC  111  airliner 
Purpose:  Research 

Material:  Digits  and  Words  from  RSRE  1983  Database  Language:  UK  English 

Quantity:  4  Hours  No  of  Spkrs:  12  Gender:  ?  Year:  1984 

Microphone:  Signal:  Waveform  Sampling  Rate: 

Medium:  SONY  PCM  Annotation: 

Availability:  Unlimited  Applications:  Aircraft 

Aux  Info: 

Contact:  J  McQuillan  Institute:  DERA/SRU 

Telephone:  +44  1684  894361  Fax:  +44  1684  894540 

E-mail:  jmq@signal.dra.hmg.gb  Date  Entered:  08  March  1994 


Serial  No:  11  Database  Name:  Speech  Recordings  in  Buccaneer  Cockpit  Noise 
Description:  Recorded  in  noise  simulator  with  116  dB  of  Buccaneer  spectrum  noise, 

Purpose:  Evaluation  of  ASR1000  speech  recogniser 

Material:  Digits,  DRT  Words  and  common  Telephone  Words  Language:  UK  English 
Quantity:  20  Hours  No  of  Spkrs:  5  Gender:  MALE  Year:  1986 

Microphone:  Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

Medium:  SONY  PCM  Annotation: 

Availability:  Unlimited  Applications:  Fast-jet  aircraft 

Aux  Info: 

Contact:  A  J  South  Institute:  DERA/Farnboro’ 

Telephone:  +44  1252  392496  Fax:  +44  1252  393091 


E-mail:  ajsouth@dra.hmg.gb 


Date  Entered:  24  March  1997 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


12  Database  Name:  Noise-in-Ears  Database 

Recorded  with  90dB  noise-in-ears,  and  noise  mixed  with  speech  at  defined  SNRs 
Evaluation  of  ASR1000  speech  recogniser 

Digits  Language:  UK  English 

36  Hours  No  of  Spkrs:  6  Gender:  MALE  Year:  1987 

Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

SONY  PCM  Annotation: 

Unlimited  Applications:  Fast-jet  aircraft 

A  J  South  Institute:  DERA/Farnboro' 

+44  1252  392496  Fax:  +44  1252  393091 

ajsouth@dra.hmg.gb  Date  Entered:  24  March  1997 
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Serial  No:  13  Database  Name:  Speaker-Independent  Connected  Speech  Database 
Description:  Various  isolated  and  connected  utterances  from  200  word  vocab 
Purpose:  Evaluation  of  automatic  speech  recognition 

Material:  Digit  Strings,  Alphabet,  &  Phrases  Language:  US  English 

Quantity:  21  Hours  No  of  Spkrs:  46  Gender:  BOTH  Year: 

Microphone:  Signal:  Waveform  Sampling  Rate: 

Medium:  Analogue  cassette  tape  Annotation: 

Availability:  Unlimited  Applications:  Assessment 

Aux  Info: 

Contact:  E  J  Cupples  Institute:  Rome  Labs 

Telephone:  +1  315  330  4024  Fax:  +1  315  330  2728 

E-mail:  cupples@rl.af.mil  Date  Entered:  24  March  1997 


1985 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


14  Database  Name:  RADC  Language  Identification  Database 
Read  text  in  7  languages  in  several  quiet  environments 

Development,  test  and  evaluation  of  language  identification  algorithms  and  techniques 


Text 

Language:  Several 

50  Hours  No  of  Spkrs: 

131  Gender:  ? 

Year:  1979 

Signal:  Waveform 

Sampling  Rate: 

Analogue  tape  7.5  ips 

Annotation: 

On  special  request 

Applications: 

Military 

E  J  Cupples  Institute:  Rome  Labs 

+1315  330  4024  Fax:  +1315  330  2728 

cupples@rl.af.mil  Date  Entered:  24  March  1997 


Serial  No:  15  Database  Name:  ARPA  Voice  Authentication  Database 
Description:  Conversational  speech  over  telephone  channels 

Purpose:  Development,  test  and  evaluation  of  speaker  identification  algorithms  and  techniques 
Material:  Free  Speech,  Read  Sentences,  CVs  &  Keywords  Language:  US  English 


Quantity:  43  Hours  No  of  Spkrs:  17 
Microphone:  Signal: 

Medium:  Analogue  tape 
Availability: 

Aux  Info: 

Contact:  S  Smith 

Telephone:  Fax: 

E-mail: 


Gender:  BOTH  Year:  1978 

Waveform  Sampling  Rate:  8000 

Annotation: 

Applications:  Speaker  identification 

Institute:  Rome  Labs 

Date  Entered:  09  March  1994 


Serial  No:  16  Database  Name:  Air  Force  Academy  Database 
Description:  Phonetic  alphabet,  digits,  and  seven  sentences  (2  standard,  5  randomly  selected) 

Purpose:  Evaluation  of  speaker-independent  recognition 

Material:  Digits,  Phonetice  Alphabet,  Sentences  Language:  US  English 

Quantity:  50  Hours  No  of  Spkrs:  635  Gender:  BOTH  Year:  1987 

Microphone:  Capacitor  &  Noise  Cancelling  Signal:  Waveform,  Lx  Sampling  Rate: 

Medium:  BETA  Format  PCM  Annotation: 

Availability:  Unlimited  Applications:  Recognition 

Aux  Info:  demographic  info  on  subjects 
Contact:  E  J  Cupples  Institute:  Rome  Labs 

Telephone:  +1  315  330  4024  Fax:  +1  315  330  2728 

E-mail:  cupples@rl.af.mil  Date  Entered:  24  March  1997 
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Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


17  Database  Name:  CVC  Word  lists 
CVCs  using  common  phonemes  in  Dutch 

Intelligibility  measurements  of  communications  systems  and  room  acoustics 
CVC  words  in  carrier  phrases  Language:  Dutch 

20  Hours  NoofSpkrs:  8  Gender:  BOTH  Year:  1990 

1/2"  Condenser  Signal:  Waveform  Sampling  Rate:  48  kHz 

DAT  Annotation: 

Unlimited  Applications:  Testing  communication  systems 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken  @  tm.tno.nl 


Institute:  TNO/IZF 
Fax:  +313463  53977 
Date  Entered:  24  March  1997 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 

Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


18  Database  Name:  DCIEM  Military  Vehicle  Noises 
Many  noises  recorded  in  military  vehicles  of  all  kinds 
Noise  Survey 

Noises  Language:  Noise 

2  Hours  No  of  Spiers:  0  Gender:  -  Year:  1975 

Free-Field,  +  Electret  Signal:  Waveform  Sampling  Rate:  N/A 

Analogue  tape  7.5  ips  Annotation: 

Applications;  Military 


B  Crabtree 


Institute:  DCIEM 


Fax: 


Date  Entered:  09  March  1994 


19  Database  Name:  EUROM-O 

Digits  and  Speech  in  5  languages,  4  speakers  each 

Multi-lingual  speech  input/output  assessment 

Isolated  digits,  digit  triples,  continuous  passage  Language:  DaDuFrltUK 
5  Hours  NoofSpkrs:  20  Gender:  BOTH  Year:  1988 

B&E  1/2"  condenser,  type  4134Signal:  Speech  w'aveform,  Sampling  Rate:  16  kHz 
CD-ROM  Annotation:  available  separately 

from  ESPRIT-S AM  partners  Applications:  Assessment 

German  available  separately 

Prof  S  Rosen  Institute:  UCL 

Fax: 

Date  Entered:  01  November  1996 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


20  Database  Name:  NOISE-ROM  -0 

Extended  version  of  RSG.10  Noise  database  (entry  No  4) 

Standard  set  of  noises  for  speech  research 

Various  noises,  mainly  military  vehicles  Language:  Noise 

2  Hours  NoofSpkrs:  0  Gender:  -  Year:  1990 

Various  Signal:  Waveform  Sampling  Rate:  20  kHz 

CD-ROM  Annotation: 

Media  charge  Applications:  Military 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken@tm.tno.nl 


Institute:  TNO/IZF 
Fax:  +31  3463  53977 

Date  Entered: 


24  March  1997 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 

Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 

Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 

Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


21  Database  Name:  DARPA  TIMIT  Acoustic-Phonetic  Speech  Database 
TIMIT  training  data 
Recogniser  assessment 


Sentences 

2  Hours  No  of  Spkrs: 

high  quality 

CD-ROM 

Unlimited 

Documentation  on  CD-ROM 
Linguistic  Data  Corporation 
+1  215  898-0464 
ldc@ldc.upenn.edu 


Language:  US  English 

No  of  Spkrs:  ?  Gender:  ?  Year:  1988 

Signal:  Waveform  Sampling  Rate:  16  kHz 

Annotation:  Orthographic  and  phonetic 
Applications:  Various 


Institute:  LDC 
Fax:  +1  215  573-2175 

Date  Entered: 


30  March  1994 


22  Database  Name:  DARPA  Resource  Management  Database 
Naval  resource  management  task,  continuous  speech 
Assessment  of  large  vocabulary  continuous  speech  recognisers 
Dialect  calibration,  training  and  test  sentences  Language:  US  English 

15  Hours  No  of  Spkrs:  160  Gender:  BOTH  Year:  1990 

Sennheiser  HMD-414  Signal:  Waveform  Sampling  Rate:  16  kHz 

CD-ROM  Annotation: 

Applications:  Military 


Linguistic  Data  Corporation 
+1  215  898-0464 
ldc@ldc.upenn.edu 


Institute:  LDC 
Fax:  +1  215  573-2175 
Date  Entered:  24  March  1 997 


23  Database  Name:  1989  RAE  Tornado  Speech  Database 
Recorded  in  rear  seat  of  a  Tornado  under  various  flight  conditions. 

Development  and  evaluation  of  speech  recognisers  for  military  fast  jets. 

Digits,  digit  triples,  and  command  phrases  Language:  UK  English 

30  Hours  No  of  Spkrs:  6  Gender:  MALE  Year:  1989 

RAF  Oxygen  mask  Signal:  Speech  waveform,  Lx,  Sampling  Rate:  32  kHz 

DAT  Annotation: 

NATO  Restricted  Applications:  Fast-jet  aircraft 

Ground  training  data  included. 

A  J  South  Institute:  DERA/Farnboro' 

+44  1252  392496  Fax:  +44  1252  393091 

ajsouth@dra.hmg.gb  Date  Entered:  24  March  1997 

24  Database  Name:  NOISEX-92 
Speech  with  noise  ADDED  at  various  SNRs 
Comparative  experiments  on  recognition  in  additive  noise. 


Digits  and  digit  triples 

5  Hours  No  of  Spkrs:  2 
SM-10 
CD-ROM 
Unlimited 

2  Speakers  from  EUROM-O 
J  McQuillan 
+44  1684  894361 
jmq@signal.dra.hmg.gb 


Language:  UK  English 
Gender:  BOTH  Year:  1992 

Signal:  waveform  Sampling  Rate:  16  kHz 

Annotation:  SAM  format 
Applications:  Recognition 


Institute:  DERA/SRU 
Fax:  +44  1684  894540 

Date  Entered: 


20  April  1994 


B-7 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


25  Database  Name:  Isolated  digits  FDC 

One  of  a  series  of  databases  for  military  aircraft  applications 

Study  of  effects  of  G-load 

Isolated  digits  Language:  French 

1  Hours  No  of  Spkrs:  4  Gender:  BOTH  Year: 

Oxygen  mask  Signal:  waveform  Sampling  Rate: 

DAT  Annotation:  ? 

Restricted  Applications:  Fast-jet  aircraft 


C.  Gulli 

+33  5  56  13  52  25 


Institute:  Sextant 
Fax:  +33  5  5613  50  54 
Date  Entered: 


21  April  1994 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


26  Database  Name:  CVCV  FDC  Words 

One  of  a  series  of  databases  for  military  aircraft  applications 

Study  of  effects  of  G  force 

Phonetically  balanced  CVCV  words  Language:  French 

1  Hours  No  of  Spkrs:  4  Gender:  BOTH  Year: 

Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

DAT  Annotation:  PTR 

Restricted  Applications:  Fast-jet  aircraft 


C.  Gulli 

+33  5  56  13  52  25 


Institute:  Sextant 
Fax:  +33  5  56  13  50  54 
Date  Entered:  21  April  1994 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


27  Database  Name:  ALPHAJET 

One  of  a  series  of  databases  for  military  aircraft  applications 

Assessment  of  recognition  rate  in  military  cockpit 

Cockpit  commands  (Rafale)  Language:  French 

4  Hours  No  of  Spkrs:  6  Gender:  MALE  Year: 

Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

DAT  Annotation:  PTT 

Restricted  Applications:  Fast-jet  aircraft 


C.  Gulli 

+33  5  56  13  52  25 


Institute:  Sextant 
Fax:  +33  5  56  13  50  54 
Date  Entered:  21  April  1994 


Serial  No:  28  Database  Name:  MIR  3  B 
Description:  One  of  a  series  of  databases  for  military  aircraft  applications 
Purpose:  Assessment  of  recognition  rates  in  military  cockpit 
Material:  Cockpit  commands  Language:  French 

Quantity:  1  Hours  No  of  Spkrs:  4  Gender:  MALE  Year: 

Microphone:  Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

Medium:  VAX  files  Annotation:  PTT 

Availability:  Restricted  Applications:  Fast-jet  aircraft 

Aux  Info:  Some  G  conditions 

Contact:  C.  Gulli  Institute:  Sextant 

Telephone:  +33  5  56  13  52  25  Fax:  +33  5  56  13  50  54 

E-mail:  Date  Entered: 


21  April  1994 


Serial  No:  29  Database  Name:  Multi-Helicare 
Description:  One  of  a  series  of  databases  for  military  aircraft  applications 
Purpose:  Assessment  of  recognition  rates  in  military  helicopter 
Material:  Avionic  sentences  Language:  French 

Quantity:  1  Hours  NoofSpkrs:  3  Gender:  MALE  Year: 

Microphone:  Signal:  Waveform  Sampling  Rate: 

Medium:  DAT  Annotation:  PTT 

Availability:  Restricted  Applications:  Helicopter 

Aux  Info:  PUMA 

Contact:  C.  Gulli  Institute:  Sextant 

Telephone:  +33  5  56  13  52  25  Fax:  +33  5  56  13  50  54 

E-mail:  Date  Entered:  21  April  1994 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 

Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


30  Database  Name:  SE1  FDC 

One  of  a  series  of  databases  for  military  aircraft  applications 
Assessment  of  recogniton  rate  under  adverse  conditions 
Avionic  sentences  Language:  French 

1  Hours  No  of  Spkrs:  6  Gender:  BOTH  Year: 

Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

DAT  Annotation:  PTT 

Restricted  Applications:  Fast-jet  aircraft 


C.  Gulli 

+33  5  56  13  52  25 


Institute:  Sextant 
Fax:  +33  5  56  13  50  54 
Date  Entered:  21  April  1994 


31  Database  Name:  SE2  FDC 

One  of  a  series  of  databases  for  military  aircraft  applications 
Assessment  of  recognition  rate  under  G 

Avionic  sentences  Language:  French 

1  Hours  NoofSpkrs:  6  Gender:  BOTH  Year: 

Oxygen  mask  Signal:  Waveform  Sampling  Rate: 

DAT  Annotation:  PTT 

Restricted  Applications:  Fast-jet  aircraft 


C.  Gulli 

+33  5  56  13  52  25 


Institute:  Sextant 
Fax:  +33  5  56  13  50  54 
Date  Entered:  21  April  1994 


Serial  No:  32  Database  Name:  DRA  Farnborough  Centrifuge  Recordings 
Description:  Recordings  of  digit  strings  and  command  phrases  with  various  types  of  protection 
Purpose:  Characterisation  of  effects  of  G  on  speech  production  and  recogniser  performance 
Material:  25, 5  digit  strings,  25  phrases,  11  SCRIBE  B  sente  Language:  UK  English 
Quantity:  12  Hours  NoofSpkrs:  6  Gender:  BOTH  Year:  1994 

Microphone:  Oxygen  mask  Signal:  Waveform  Sampling  Rate:  16  kHz 

Medium:  two  CD-ROMs  Annotation:  SAM  format 


Availability:  NATO  Restricted  Applications:  Fast-jet  aircraft 

Aux  Info:  5  males,  1  female 

Contact:  A  J  South  Institute:  DERA/Farnboro' 

Telephone:  +44  1252  392496  Fax:  +44  1252  393091 

E-mail:  ajsouth@dra.hmg.gb  Date  Entered:  24  March  1997 


B-9 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


33  Database  Name:  Cockpit  control 
Command  strings  and  isolated  words  for  control  of  F-16 
Recogniser  evaluation 

Cockpit  control  words  (281  word  vocabulary  Language:  English 

10  Hours  NoofSpkrs:  5  Gender:  MALE  Year:  1996 

Electret,  fitted  inside  mask  Signal:  Waveform  Sampling  Rate:  48  kHz 

DAT  Annotation:  Word  level 

Applications:  Military  Fast  jet 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken  @  tm.tno.nl 


Institute:  TNO/IZF 
Fax:  +31  3463  53977 

Date  Entered:  30  October  1996 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


34  Database  Name:  SUSC-0 

Speech  Under  Stress  Conditions 

Analysis  of  stressed  speech  and  testing  of  systems 

Fighter  controller  dialogues,  spontaneous  cockpit  Language:  English 

3  Hours  NoofSpkrs:  12  Gender:  Male  Year:  1995 

Various  Signal:  Waveform  Sampling  Rate:  16  kHz 

one  CD-ROM  Annotation: 

Applications:  Research 


Dr.  H  J  M  Steeneken 
+31  3463  56269 
steeneken  @  tm.tno.nl 


Institute:  TNO/IZF 
Fax:  +31  3463  53977 

Date  Entered:  30  October  1996 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


35  Database  Name:  Tornado  SI  training  data 
240  words  from  cockpit  tasks,  29  speakers,  oxygen  mask  and  noise-in-ears 
Training  data  for  speaker-independent  tests  on  recognisers 
Isolated  words,  some  digit  strings  Language:  UK  English 

14  Hours  NoofSpkrs:  29  Gender:  MALE  "  Year:  1996 

Oxygen  mask  Signal:  Waveform  Sampling  Rate:  48/16  kHz 

DAT  or  CD-ROM  Annotation:  SAM  format  from  PTR 

Nato  restricted  Applications:  Fast-jet  aircraft 


A  J  South 

+44  1252  392496 

ajsouth@dra.hmg.gb 


Institute:  DERA/Farnboro’ 

Fax:  +44  1252  393091 

Date  Entered:  30  October  1996 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


36  Database  Name:  SUSAS 

Stressed  speech  from  fairground  rides  and  helicopters,  multi-style  speech 
Research  into  speech  under  stress 

35  aircraft  communication  words  Language:  US  English 

2  Hours  NoofSpkrs:  20  Gender:  BOTH  Year:  1995 

Signal:  Sampling  Rate: 

CD-ROM  Annotation: 

Applications:  Research 


JHL  Hansen 
+1919  660  5256 
jhlh@ee.duke.edu 


Institute:  Duke  Univ. 

Fax:  +1  919  660  5293 

Date  Entered: 


30  October  1 996 


B-10 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


37  Database  Name:  DCIEM  Sleep  Deprivation  Study  Map  Task  Corpus 
HCRC  Map  Task  carried  out  during  64  hour  without  sleep,  with  drugs  or  placebo 
Part  of  a  major  study  on  effects  of  continuous  work  in  prolonged  sleep  deprivation 
Spontaneous  dialogues  on  HCRC  Map  task  Language:  English 

18  Hours  NoofSpkrs:  36  Gender:  BOTH  Year:  1994 

Shure  SM10A  Signal:  Waveform  Sampling  Rate: 

CD-ROM  Annotation:  Orthographic,  turn  onset,  sgml 

Unlimited  Applications:  Studies  of  dialogue,  etc. 


Linguistic  Data  Corporation 
+1  215  898-0464 
ldc@ldc.upenn.edu 


Institute:  LDC 
Fax:  +1215  573-2175 
Date  Entered: 


01  April  1997 


Serial  No:  38  Database  Name:  Tornado  TV-TABS 
Description:  Recordings  made  in  the  back  seat  of  Tornado  GR1 
Purpose:  Assessment  of  ASR  in  fast-jet 

Material:  Isolated  &  connected  digits,  Command  phrases  Language:  English 
Quantity:  10  Hours  NoofSpkrs:  6  Gender:  MALE  Year:  1993 

Microphone:  P/Q  Oxygen  mask  Signal:  Waveform  Sampling  Rate:  16  kHz 

Medium:  CD-ROM  or  DAT  Annotation:  SAM  format,  via  PTR 

Availability:  Applications:  Fast-jet 

Aux  Info:  Training  material  recorded  on  ground,  noise  in  ears, 

Contact:  A  J  South  Institute:  DERA/Farnboro’ 

Telephone:  +44  1252  392496  Fax:  +44  1252  393091 

E-mail:  ajsouth@dra.hmg.gb  Date  Entered:  07  April  1997 


Serial  No: 
Description: 
Purpose: 
Material: 
Quantity: 
Microphone: 

Medium: 
Availability: 
Aux  Info: 
Contact: 
Telephone: 
E-mail: 


39  Database  Name:  DERA  Car  number  plate  database 
Dictation  of  UK  car  number  plates 
Evaluation  of  ASR  under  stressed  speech 

UK  Car  numbers  with  digits  and  ICAO  alphabet  Language:  UK  English 

5  Hours  NoofSpkrs:  16  Gender:  BOTH  Year:  1995 


SM-10 

CD-ROM 

Unlimited 

Two  speed  conditions 
J  McQuillan 
+44  1684  894361 
jmq  @  signal.dra.hmg.gb 


Signal:  Waveform  Sampling  Rate: 

Annotation: 

Applications:  ASR  Assessment 

Institute:  DERA/SRU 
Fax:  +44  1684  894540 

Date  Entered:  05  June  1997 


Serial  No:  40  Database  Name:  Lynx  Simulation  database 
Description:  Helicopter  cockpit  control  phrases  recorded  in  realistic  noise  and  vibration 
Purpose:  Assessment  of  recogniser  performance  in  helicopter  environment 
Material:  DVI  command  phrases,  isolated  word  training  data  Language:  English  (UK) 

Quantity:  18  Hours  NoofSpkrs:  7  Gender:  MALE  Year:  1997 

Microphone:  Socapex  1091/G  or  Racal  D13750  Signal:  Waveform  Sampling  Rate:  16  kHz 
Medium:  CD-ROM  (or  DAT)  Annotation:  SAM  format 

Availability:  Yes  (terms  under  discussion)  Applications:  Recogniser  assessment 

Aux  Info:  Speakers  were  flying  simulated  attack  helicopter  missions  to  provide  workload  stimulation 
while  reading  lists. 

Contact:  A  J  South  Institute:  DERA/Farnboro' 

Telephone:  +44  1252  392496  Fax:  +44  1252  393091 

E-mail:  ajsouth@dra.hmg.gb  Date  Entered:  12  August  1997 


This  list  is  maintained  on  behalf  of  NATO  AC232(IST)/TG001  by:  Allan  South 

Systems  Integration  Dept.,  DERA 
Room  2067,  A5  Building,  Ively  Road, 
Farnborough,  Hampshire,  GU 1 4  0LX,  UK 


Tel:  +44  1252  39  2496  /  Fax:  +44  1252  39  3091 
e-mail:  ajsouth@dra.hmg.gb 


c- 


Annex  C.  Other  Sources  of  Information 

For  details  of  other  speech  and  language  databases  (not  specifically  military),  see: 


Linguistic  Data  Corporation 
3615  Market  Street, 

Suite  200, 

Philadelphia,  PA  19104-2608 
USA 

Tel:  +1  215  898-0464 
Fax:  +1  215  573-2175 
E-mail:  ldc@ldc.upenn.edu 
Web:  http://www.ldc.upenn.edu 


European  Language  Resources  Association: 

ELRA/ELDA 
87,  Avenue  d'ltalie, 

75013  Paris, 

France, 

Tel:  +33  1  45  86  53  00 

Fax:  +33  1  45  86  44  88 

E-mail:  elra@calvanet.calvacom.fr 

Web :  http  ://w  w  w .  icp .  grenet .  fr/ELR  A/home  .html 
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